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ABSTRACT 

Gravitational lenses with anomalous flux ratios are often cited as possible evidence for dark 
matter satellites predicted by simulations of hierarchical merging in cold dark matter cos- 
mogonies. We show that the fraction of quads with anomalous flux ratios depends primarily 
on the total mass and spatial extent of the satellites, and the characteristic lengthscale di/ 2 of 
their distribution. If di/ 2 ~ 100 kpc, then for a moderately elliptical galaxy with a line-of- 
sight velocity dispersion of ~ 250 km s -1 , a mass of ~ 3 x 1O 9 M0 in highly-concentrated 
(Plummer model) satellites is needed for 20% of quadruplets to show anomalous flux ratios, 
rising to ~ 1.25 x 10 10 Mq for 50%. Several times these masses are required if the satellites 
have more extended Hernquist profiles. Compared to a typical elliptical, the flux ratios of 
quads formed by typical edge-on disc galaxies with maximum discs are significantly less sus- 
ceptible to changes through substructure - three times the mass in satellite galaxies is needed 
to affect 50% of the systems. 

In many of the lens systems with anomalous flux ratios, there is evidence for visible 
satellites (e.g., B2045+265 or MG04 14+05 34). We show that if the anomaly is produced by 
substructure with properties similar to the simulations, then optically identified substructure 
should not be preponderant among lens systems with anomalies. There seem to be two pos- 
sible resolutions of this difficulty. First, in some cases, visible substructure may be projected 
within or close to the Einstein radius and wrongly ascribed as the culprit, whereas dark matter 
substructure is causing the flux anomaly. Second, bright satellites, in which baryon cooling 
and condensation has taken place, may have higher central densities than dark satellites, ren- 
dering them more efficient at causing flux anomalies. 
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1 INTRODUCTION 

The abundance of substructure in galaxy halos is emerging as a key 
test in theories of galaxy assembly. In Cold Dark Matter cosmogo- 
nies, dark matter overdensities collapse to form cusped halos, with 
the smalles t and least massive halo s being the d ensest. The sim- 
ulations of Klyp in et ail i 19991) and iMoore et ail ( 1 19991) predicted 
hundreds of small Galactic satellite halos, in contrast to the nine 
then known satellites around the Milky Way. Efstafhioi] dl992l) had 
already suggested that photoionisation may lengthen the cooling 
times of gas in haloes with low circular speeds. This effect sup- 
presses the formation of satellite galaxies, but produces a large 
population of enti rely dark satellites (see e.g. lKravtsov et al]|2004l ; 
IMoore et alfaoOfj) . 

In strong lensing, it has been known for some years that sim- 
ple, smooth models of galaxy lenses usually fitted the image posi- 
tions well, but the flux rat ios of the images poorly. In a bold paper, 
iDalal & Kochanekl §002) argued that the flux anomalies in gravi- 
tational lens systems could be interpreted as evidence for entirely 
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dark substructures. They used 7 of the then available four-image 
lens systems to claim detection of substructure amounting to a total 
mass of 0.6% — 6% of the lens galaxy mass. However, this inter- 
pretation was challenged as flux anomalies could arise from alter- 
native sources, such as absorption, scattering, or scintillation by the 
interstellar medium of the lens galaxy, or by higher order harmon- 
ics in the ellipsoidal models used to fit lens systems, or stellar mi- 
crolensing (see e.g. lEvans & Wittll2003l ; iKochanek & Dalalll2004l : 
iMao et ail 12004). In particular, the surface mass density in sub- 
structure as judged from simulations seems to be lower than that 
required by gravitational lensing, at least within the Einstein ra- 
dius w hich probes primarily the inner parts of halos (e.g. jMao et al.1 
1 20041) . It is also hard to repr oduce the observed statisti cs on cusp vi- 
olations with substructure jMaccio & Mira nda 2006). This argues 
against substructure as a primary cause of anomalous flux ratios. 

Nonetheless, there is good evidence in favour of substructure, 
and, in some cases, visible substructure can be identified. One of 
the gravitational lens systems with a flux ratio ano maly is the radio- 
loud q uadruple CLASS B2045+265 discovered by Fassnacht et al. 
(1999). Recen t deep Hubble Space Telescope and Keck imaging of 
this system bv lMcKean et alj 1 120071) have revealed the presence of 
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a tidally disrupted dwarf galaxy G2. This may be the cause of the 
flux ratio anomaly, although caution is needed as modelling sug- 
gests that G2 must be very highly flattened (q — 0.13). There is 
also evidence for visible structure - po ssibly a small galaxy — in the 
radio-loud quadruple MG0414+0534. ISchechter & Moord J 19931) 
already argued that the perturbation caused by this object may ac- 
count for the relatively poor agreement between the observed data 
on this lens and the theoretical m odels. The quadruple lens systems 
CLASS B1608+656 (see e.g.. iFassnacht et aljfl996h has a loose 
group of galaxies at the same redshift as the main lensing galaxy, 
a phenomen on that also occurs in the six-image radio-loud system 
B 1359+154 teusin et ai]|200ll) . It seems that visible substructure 
may well be responsible for some of the flux ratio anomalies. 

In addition, there have been striking observational develop- 
ments nearer to home. The last two years have seen the discov- 
ery of 10 faint, new Milky Way satellites in data from the Sloan 
Digital Sky Survey (SDSS, see Willman et al. 2005, Zucker et al. 
2006a,b; Belokurov et al. 2006, 2007). It seems likely that a popu- 
lation of ultra-faint, dwarf galaxies does surround the Milky Way. 
These may be representativ es of the "mi s sing s atelli tes", as pre- 
dicted by the simulations of iKlypin et al.l i 19991) and iMoore et al.l 
dl999h . or they may b e a population of tidal d warf galaxies or even 
star clusters (see e.g.. lBelokurov etlti1l2007l) . Such ultra-faint ob- 
jects are only detectable nearby, and so would be - to all intents 
and purposes - dark at the typical redshifts of strong lenses. 

All this suggests that it is worth re-examining the effects of 
substructure on strong lenses. In this paper, we answer the follow- 
ing questions. Given what we know about the satellite populations, 
how frequently might we expect anomalous flux ratios for ellip- 
tical and spiral galaxies? If luminous satellite galaxies represent 
the bright and massive end of a predominantly faint population 
of objects, how frequently might we expect to attribute flux ratio 
anomalies to visible objects? Three (B2045+265, MG0414+0534, 
B 1608+6 56) out of the sample of s ix quadruplets originally iden- 
tified by iDalal & Kochanekf j2002t) as anomalous have optically 
identified companions that are possible causes. At the outset, this 
seems surprisingly high, if the substructure is predominantly dark. 

The paper is arranged as follows. In §2, models of elliptical 
and spiral galaxies are briefly introduced, together with their satel- 
lite galaxy populations. In §3, the frequency with which anomalous 
flux ratios occur is shown to depend primarily on the scalelength 
of the satellite distribution, the mass model used for the satellites 
and the total mass in substructure, while depending only weakly on 
how the mass is distributed between satellites. The simulations re- 
ported in §4 give the expected fraction of anomalous flux ratios for 
both ellipticals and spiral lenses, together with the typical numbers 
caused by high mass and luminous satellite galaxies. 



2 METHODOLOGY 
2.1 Mass Models 

For convenience, we follow I Schneider et"ai1 d 19921) by defining in 
the lens plane the dimensionless distance, dimensionless surface 
mass density and critical surface mass density 
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with D s , Di, D\ s being the distances to the source, lens, and be- 
tween lens and source, and £o an arbitrary scale length which re- 
lates the scaled (uncapped) coordinates to the unsealed (capped). 



The corresponding dimensionless deflection potential is the solu- 
tion of the Poisson Equation 



and the dimensionless bending angle is a = Vip. 



(2) 



2.1.1 Primary Lens 

We examine two models for the main lens galaxy. The first is ap- 
propriate for an elliptical galaxy lens. It is a pseudo-isothermal 
elliptic deflection potential (see e.g. iKassiola & Kovned 1 19931 : 
Hunt er & Evansll 200 ll ; lE vans & Hunter 20o2) 



y) = E T (r 2 c + (1 - e)x 2 + (1 + t)y 2 ) 
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where r c is a dimensionless core radius (the length scale being the 
arbitrarily chosen £o), an d 
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is the dimensionless Einstein-ring radius of the singular isother- 
mal sphere with line-of-sight velocity dispersion a corresponding 
to the r c — 0, e = case. There are two critical curves: a small 
inner one which maps to a 'radial' caustic and an outer 'tangen- 
tial' one which maps to an astroid caustic. At the ranges of e we 
consider, the astroid caustic is wholly within the outer caustic. A 
point source has one image if it is outside both caustics, three if it 
is inside the outer caustic, and five if it is inside the astroid caus- 
tic. Triplets and quintuplets, however, are effectively doublets and 
quadruplets, because one of the multiple images is a highly de- 
magnified central image within the small inner critical curve. For 
concreteness, we consider the elliptical potential to have (unless 
otherwise specified) a velocity dispersi on a = 250 km s - 1 and a 
cor e radius of 100 pc, as s uggested by IKassiola & Kovned d 19931) 
and lEvans & Hun ter ( 2002). We restrict the ellipticity parameter to 
be smaller than e ~ 0.2, otherwise the corresponding surface mass 
density becomes du mbbell-shaped, which i s inappropriate for el- 
liptical galaxies (see lKassiola & Kovnedll993h . 

The second model is appropriate for a spiral galaxy lens. It is a 
three-component model widely used in ga lactic astronomy (see e.g. 
iDinescu et all 1 9991, IFellhauer et al.l2006h as a model for the Milky 
Way. It has a iHemquisM d 19900 bulge, a Miyamoto-Nagai (1975) 
disc and a cored isothermal halo. The Hernquist bulge has 3D mass 
distribution 



2tt f(f + r^) a 



(5) 



where f is the spherical polar radius and rb a core radius. That of 
the halo is 



p(f) = 



(6) 
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where is the core radius and p c the central density. The mass 
distribution of the Miyamoto-Nagai disc is complicated, but the de- 
flection potential in the edge-on case is simple: 

<Ad = \m d log \x 2 + (a + Vb 2 + V 2 f] , (7) 

where ma = Md/(7rE cr £o) is the dimensionless mass and a 
and b control the shape of the distribution. We normalize our disc 
galaxy lens to the M ilky Way, according to the parameters given in 
IShin&Evansl ll2007). Disc galaxies give rise to thr ee main different 
classe s of multiple-image configurations (see e.g. iMoller & Blair] 
1998): 'core triplets' (in effect, doublets), 'disc triplets' (where 
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images straddle the plane of the disc) and quintuplets (in effect, 
quadruplets). The small 7-imaging butterfly cusp in the caustic of 
this e dge-on Milky Way is ignored here (see e.g. IShin & Evansl 
120071) . 

We choose the redshift of the lens to be 0.46 and that of the 
source to be 2.15. These are the median redshifts of known 4-image 
lens systems (see the CASTLES website), omitting those known to 
have more than one main lens and those without known lens and 
source redshifts. We use a flat ACDM cosmology with Q m = 0.27, 
fi A = 0.73, H = 71kms- 1 Mpc" 1 . 



2. 1.2 Phtmmer Model Satellites 

The Plummer mod el is often fitted to observed d warf spheroidal 
galax ies (see e.g. McConnachie & Irwin 2006, Wilkins on et all 
2002). We give our satellites densities 



K(r k ) = 4 fc) (1 + \M) 



(8) 



where rt = y x\ + y\ , and being Cartesian coordinates 
with their origin at the centre of the kth satellite galaxy, is the 

(k) (k) 

central density of that galaxy, and = £o/?p where r p is the 
Plummer model scale radius. The mass of any satellite is 



(9) 



We give a 'typical' 1O 7 M0 Plummer satellite a scale radius (equal 
to its half-light radius) of 140 pc, which is the median half-light 
radius of known Milky Way satellite spheroidal galaxies (see e.g. 
iBelokurov et alj|2007» . and vary r p as y/M p . This mass-radius 
scaling is consistent with those found in A-body simulations, 
where the size of satellites scales as a power of the mass. We also 
put a lower limit on r p of 70 pc, the s mallest known half-lig ht ra- 
dius of a Milky- Way dwarf spheroidal (Belo kurov et alj2007h . 



2. 1.3 Hernquist Model Satellites 

We also examine a second density profile for the satellites, 
mot ivated by numerica l simulations. The NFW density pro- 
file jNavarroet al.ll 19961) 
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is found to be a good fit to cold dark matter subhalos. Rather than 
using the NFW, which falls off as f~ 3 at large radii and therefore 
has formally infinite total mass, we use the Hernquist density pro- 
file, which is similar to the NFW profile, but has an f~ 4 asymptotic 
density decay. Our Hernquist satellites are less concentrated than 
the Plummers, and we make a lowest order estimate of the effects 
of tidal stripping by truncating them at tidal radii r t ' fe ' defined by 
the condition 



(11) 



where f is the radial distance of the kth satellite from the centre 
of the main galaxy, p^ k '(r\^ h ') its mean mass density, and p(r) 
the mean mass density due to the main galaxy enclosed within a 
sphere of radius r when the pseudo-isothermal elliptical potential 
is approximated by a singular isothermal sphere, that is, 
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The truncated Hernquist profile is then 
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Figure 1. Probability density of satellites with respect to the lens-plane po- 
lar radius R. The solid line is for d±/ 2 = 100 kpc, the dashed line for 
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where f k is the radial distance from the centre of the fcth satellite, 
are the untruncated Hernquist masses and r$ are Hernquist 
scale lengths. 

The scale length of each satellite is chosen so that the location 
of the peak circular velocity of the model (which is r s ) varies with 
the mass bound within the tidal radius M t = Mr, 2 /(r t 2 + r 2 ) as 
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in agreement with the numerical simulations of, for example, Die- 
mand, Kuhlen & Madau (2007). 

If the tidal radius of any Hernquist satellite is less than its scale 
radius, we reject it as being too strongly tidally disrupted to be a 
proxim ated by this mass profile. For example, iMetcalf & Madai 
find that NFW clumps lose their mass rapidly for rt < r s . 
It is beyond the scope of this paper to consider lensing effects due 
to the tidally-stripped matter from dwarf galaxies. 



2.1.4 Spatial Distribution of Satellites 

The distribution of satellite galaxies is taken as spherically sym- 
metric. Explicitly, we assume that the number density in the lens 
plane is 



n(R) oc (R 2 + r\) 



1/2 



(15) 



where m > 1 determines the asymptotic fall-off of the distribution, 
and rd is a central softening parameter. We choose m = 5/2 so that 
the three-dimensional number density, obtained by Abel deprojec- 
tion of j 1 5b . falls off like (distance) - 3 5 , similar to the behaviou r 
observed in the Milky Way (see e.g.. I Wilkinson & Evansl IT"999h . 
We choose two different rd, so that half the satellites are within a 
(spherical polar) radius di/2 of 100 kpc or 50 kpc respectively of 
the centre of the main galaxy. The distributions are plotted in Fig.Q] 
Note that, at least as judged from the case of the Milky Way, we ex- 
pect half the satellites to lie within 100 kpc, and so di/2 ~ 100 kpc 
is perhaps the more realistic. 
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Figure 2. The proportions F4 of quad (circles) and F2 of doublet (squares) 
lens systems that have a flux ratio changed by 5% by Plummer satellite 
galaxies, as a function of the mass per satellite galaxy. The total mass in 
satellites, M sa t, is fixed at 5 X 10 9 Mq and the scalelength of the distribu- 
tion at = 100 kpc (solid points and line) and di/2 = 50kpc (open 
points and dotted line). The lens is a pseudo-isothermal elliptic potential 
with e = 0.1. Note especially that F4 is only weakly dependent on how 
the mass is shared between satellites. The error bars are 2<r of a binomial 
distribution. 

2.2 Numerical Details 

For a given source position, the lens equation can be solved numer- 
ically by first triangulating the image plane, mapping the grid to 
the source plane to triangulate the image positions, and then using 
a multi-dimensional Newton-Raphson procedure. 

The satellite galaxies have small effects on the positions of the 
images and on critical curves and caustics. Their effect is mainly 
on the ratios between image fluxes. When a satellite galaxy is suf- 
ficiently near an image, it changes the magnification of that im- 
age. Only if the separation between the satellite and the image is 
still smaller does image splitting occur. In this paper, we focus on 
the effect of the satellites on the flux ratios of lenses rather than 
the image multiplicity. When images split, they are near new criti- 
cal curves created by the satellite galaxy and are highly magnified, 
which certainly results in a flux ratio change. 

We generate positions of sources randomly to find 1000 five- 
image and 1000 three-image systems. For each system, the image 
positions and magnifications are found numerically, both for the 
main galaxy alone and with satellite galaxies at positions randomly 
generated according to the distribution H5\ (or its 3D deprojection, 
for the purposes of finding tidal radii). We count the number of 
systems where the satellite galaxies change the ratio of any two 
image fluxes by 5% or more (discounting the usually unobservable 
central images). 



3 THE TOTAL MASS IN SATELLITES 
3.1 Plummer Satellites 

For the moment, let us fix the total mass in satellite galaxies M aat 
as 5 x 1O 9 M0 and share it equally among a varying number N 
of satellites. The lens galaxy is an e = 0.1 pseudo-isothermal 
elliptical potential, and the satellites are spatially distributed with 
di/2 = 100 kpc. The results are shown in Fig. [2] which illustrates 
that, over a large range of M p or N, the proportion F4 of quads 
with flux ratios affected by the satellites is only weakly dependent 
on how mass is apportioned between the satellites. That is, the pro- 
portion F4 is mainly sensitive to the total mass Af aat = NM P 



rather than N or M p . The solid line in Fig. [2] suggests that satel- 
lites of mass 10 9 M Q are only 50% more efficient at altering flux 
ratios than those of mass 1O 7 M0. The Plummer scale radius r p , 
if set differently, can affect the results. For example, fixing r p to 
140 pc for all M p causes Fi fall off rapidly below M p = 10 7 M o , 
as the central density of the satellites decreases. 

Fig. [2] can be understood qualitatively as follows. The mag- 
nification of an image is changed by ^ 5% - and the flux ratios 
of the lens system affected - if the image is within a 'radius of 
influence' r e g of a satellite. Here, r e g depends on the location and 
magnification of the image, as well as the mass of the satellite. (r e ff 
increases with the mass of the satellite and the magnification of the 
image. Note that there can be an ambiguity in r c ff. As a satellite 
moves closer to the image, the image may be first slightly bright- 
ened, before dimming rapidly as the satellite approaches, or first 
slightly dimmed before brightening rapidly. So, for a large enough 
satellite, there may be two ranges in which the flux changes by 
^ 5%. For the purpose of our rough argument here, however, we 
ignore this complication.) For a given configuration, the probability 
that the flux ratio is changed appreciably by satellites depends not 
only on the number of satellites and their probability distribution 
in space, but on the individual values of r e ff. Even so, ensemble 
averaging over many image configurations of the same type (e.g. 
quads) yields an r c ff that depends only on the mass of the satellite. 

The fractions of quads and doublets that are affected by satel- 
lites depend on the fractional area A sat of the lens plane that lies 
within the circles of influence. That is, 

1 - F 4 « (1 - ^ sat ) 4 => F 4 ~ 4yl sat - 6A s 2 at , 

1 - F 2 « (1 - ^ sat ) 2 => F 2 w 2A sat - Aj at , (16) 

where r e fr,-A S at are different for quads and doublets. If j4 sa t is 
small, the circles of influence do not overlap, so 

JV 

A sat OC Y. nr cB 2 ■ ( 17 > 

The gently increasing Fi in Fig.|2]over a range of M p reflects a 
dependence r 2 ff oc for p > 1, For di/2 = 50 kpc, the propor- 
tional increase in F4 between, say, 1O 7 M0 and 1O 9 M0 (see dotted 
line in Fig. [2} is less than for di/2 = 100 kpc because of greater 

overlapping (A sa t increases more slowly than YliLi 7rr off 2 )- Dou- 
blets are much less strongly affected than quads not only because 
there are only two rather than four images whose fluxes could be 
affected (hence the 2j4 sat rather than 4 J 4 sat term in d!6t). but, more 
importantly, because their images are typically of much lower mag- 
nification. 

Let us now allow the total mass M sat in satellite galaxies to 
vary. Since Ft are not quite independent of how M sa t is appor- 
tioned between satellites, we need to allow for different individual 
satellite masses. We draw them from an M~ 2 distribution with cut- 
offs at 5.0 x 10 6 M© and 5.0 x 10 9 M Q , and vary N to vary M sat . 
(Cutoff masses of below 5.0 x 10 6 were too computationally ex- 
pensive because many more satellites would have been needed for 
the same total masses.) The masses of the satellites (along with 
their positions) are regenerated for each new source position. The 
results are shown in Fig. [3] where Fi are plotted as a function M p 
on a log-log scale. The scalings l | 1 6t and (17} can be seen: Fi ini- 
tially increase <linearly with M sat (as A aat oc N oc A/ sat for a 
given satellite mass function), until the overlaps between circles of 
influence can no longer be neglected, after which Fi approach unity 
asymptotically. The proportions Fi go from 20% to 80% over about 
one order of magnitude of M sat (this can be seen even more clearly 
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Figure 3. The proportions Fj of quads and doublets (upper and lower 
curves) as functions of the total mass M sa t in Plummer satellites drawn 
from the M~ 2 distribution, on a log-log scale. Here, di/ 2 = 100 kpc; 
similar behaviour holds for other di/2 (not illustrated). The dotted lines are 
of unit gradient: when the percentage of affected systems is low, Fi scales 
as < Maat . 

from Fig. [6] in the next section). At all M sa x, flux ratios of quads 
are much more likely to be affected than those of doublets. 

3.2 Hernquist Satellites 

Since the bound mass Mt of a Hernquist profile satellite depends, 
through dl It and dl4b on its initial mass M and its distance from 
the main galaxy (generated randomly in this simulation), we cannot 
repeat ^3.1| exactly. We can only fix the total initial mass, apportion- 
ing it equally between the varying number of satellites N. More- 
massive satellites, which are therefore less concentrated, are more 
prone to tidal disruption: for di/a = 100 kpc, a total initial mass of 
1.0 x 1O 1O M results in a total tidally bound mass of ~ 7x 10 9 M o 
if shared equally between 1000 dwarfs of initial mass 10 7 M Q ,and 
~ 6 x 10 9 M Q if shared between 10 of initial mass 1O 9 M . For 
di /2 = 50 kpc these fall to ~ 6.3 x 1O 9 M and ~ 5.7 x 1O 9 M 
respectively. Even more significantly, the most massive satellites 
cannot stay bound close-in to the main galaxy (whereas smaller 
satellites can), decreasing their chances of lying near the centre of 
the lens in projection, where the images typically are. This is dra- 
matically illustrated in Fig. [4] where for di/ 2 = 100 kpc the pro- 
portion of systems with altered flux ratios only varies weakly with 
N, whereas for d 1/2 = 50 kpc satellites with Mt < 5 x 10 7 M o 
are much more efficient at changing fluxes than massive ones. How- 
ever, this result depends on the assumption that a satellite which is 
highly tidally disrupted (r t < r s ) can be ignored, on the grounds 
that its mass surface density is so diffuse that it has little effect on 
the lensing fluxes. 

Drawing initial masses from the same M~ 2 distribution as 
before, and varying N to vary the total bound mass M sa t > we ob- 
tain Fig. [5] which is the analogue of Fig. [3] The Hernquist profile 
satellites, which are more extended and diffuse than the Plummer 
models, are less efficient at altering fluxes. 



4 ASTROPHYSICAL APPLICATIONS 
4.1 Elliptical Galaxy Lenses 

Let us consider several different sets of parameters of elliptical po- 
tentials ([3} and ask what mass in satellites is required for there to be 
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Figure 4. As Fig. [2] but with an initial total mass in Hernquist profile satel- 
lites fixed at 1.0 X 10 10 Mq. Fi are plotted against the average tidally 
bound mass per truncated Hernquist satellite. 
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Figure 5. The proportions Fi of quads and doublets (upper and lower 
curves) as functions of tidal mass M aa t in Hernquist dwarfs, d 1 / 2 = 
100 kpc. The dotted lines are of unit gradient. 

a significant probability of flux-ratio changes. We first use Plummer 
model satellites, drawn from the M~ 2 mass function, and again 
vary the number of satellites to vary the total mass. The results are 
plotted in Figure [6] 

For the moderately elliptic case e = 0.1, masses of ~ 1.25 x 
1O 1O M0 are needed for 50% of quadruplets to show anomalous 
flux ratios if d 1 / 2 = 100 kpc, decreasing to ~ 5 x 10 9 for 
di/2 — 50 kpc. The lengthscale of the satellite distribution di/ 2 
has a large effect on both Fi and F 2 . As expected, the higher the 
probability density of satellites in the inner parts of the lens plane 
(where the images lie), the greater the proportion of anomalous 
flux ratio systems. Comparing left and right panels, we see that 
the effect of satellites on flux ratios of quadruplets decreases as e 
increases - and the quadruplet cross-section increases, while the 
mean magnification of quads falls. The proportion of doublets F 2 , 
however, is not noticeably affected by the ellipticity e. 

Changing the redshifts of lens and source affects these results 
only through E cr , on which E T in eq ® and in eq l[8} depend. 
Fi for two sets of redshifts are plotted in Fig. [7] for d\ = 50 kpc 
and e = 0.1. We see that the results are not very sensitive to differ- 
ent redshifts or different E cr . The results are also not very sensitive 
to the core radius of the pseudo-isothermal elliptic potential, al- 
though they are affected by its velocity dispersion a, as shown in 
Fig. [8] We recall that the velocity dispersion controls the Einstein 
radius E T — cr 2 (£oGI] cr ) -1 , with more massive lens galaxies di- 
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Figure 6. Plots of Ff (circles for quads, squares for doublets) as functions 
of total mass M sa t in Plummer dwarfs, for various parameters. The lens is 
a pseudo-isothermal elliptic potential with ellipticity e, while the satellite 
distribution has a characteristic scalelength of di/2 as indicated in the top 
left of each panel. 
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Figure 8. Percentages of systems with flux ratios affected by Plummer 
model satellite galaxies, as a function of the total mass in satellites, for 
redshifts z\ = 0.46 and z B = 2.15, for three different velocity dispersions: 
a = 200 km s" 1 (dashed line), 250 km s -1 (solid line), 300 km s" 1 
(dotted line). The remaining parameters are e = 0.1 and cii/2 = 100 kpc. 
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Figure 7. Proportion of systems with flux ratios affected by Plummer model 
satellite galaxies, as a function of total mass in satellites, for two sets of 
redshifts. Filled points and solid line: z\ = 0.46, z s = 2.15 (corresponding 
to E cr = 2.04 X 1O 9 M kpc -2 ). Open points and dotted line: z\ = 0.21, 
z s = 1.22 (corresponding to E cr = 3.05 X 1O 9 M kpc" 2 ). The first set 
is the median lens- and source-redshifts of known 4-image systems; the 
second z\ and z B are one standard deviation below the median. 

luting the effect of the satellites on flux ratios. A pseudo-isothermal 
elliptic potential with a = 300 kms" 1 requires ~ 2 times as much 
mass in satellite galaxies as a a = 200 km s" 1 model for F A = 0.5, 
or 50% of the quads to have anomalous flux ratios. However, the 
Einstein radius of the main galaxy goes as a 2 , and the projected 
mass within the Einstein ring as a 4 , so a given relative mass in 
satellite companions is more likely to affect flux ratios in more mas- 
sive elliptical lenses. 

When the less concentrated, tidally-stripped Hernquist profile 
satellites are used instead of Plummer models, the effect on flux 
ratios is significantly weaker: Fig. [9]is the analogue of Fig. [6] The 



Figure 9. Effect of Hernquist satellites on flux ratios (analogue to Fig. [6). 

initial masses of the Hernquist dwarfs are drawn from the same 
M~ 2 distribution, and Ft are plotted against the total tidal mass in 
satellites. For the e — 0.1 elliptical potential, if d 1 / 2 = lOOkpc, 
some 4.5 x 10 10 Mq in satellites is needed for 50% of quadruplet 
flux ratios to be altered (over three times the mass in Plummer satel- 
lites needed), dropping only to 2 x 10 10 for di/2 = 50 kpc (about 
eight times the mass in Plummer satellites needed) because of the 
increased tidal stripping of closer-in satellites. 

4.2 Spiral Galaxy Lenses 

Now let us change the lens galaxy to an edge-on spiral, using the 
model based on eqns {5}-l[7]l. Plots of Fi as functions of M sat , for 
Plummer model satellites, are plotted in Fig.[l0]as solid lines along- 
side those for a pseudo-isothermal elliptic potential as dotted lines. 
The upper panel shows the model in which the disc is maximal 
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Figure 10. The upper panel shows Fi as functions of the total mass in Plum- 
met' satellites, for an edge-on Milky Way with maximal disc (solid lines and 
filled symbols) and a pseudo-isothermal elliptic potential for comparison 
(dotted lines and open symbols) with a = 250 km s . The lower panel 
shows the same quantities but for a sub-maximal disc. Both galaxies are 
at z\ = 0.46, lensing sources at z B = 2.15. d 1 /2 = 50kpc. Quads are 
shown as circles, doublets as squares, and disc triplets as triangles. 

and provides most of the rotational support in the inner parts, the 
lower panel shows a model in which the disc is sub-maximal (see 
IShin & Evansll2007l for a detailed discussion of the lensing proper- 
ties of these models). There are three solid lines in the panels, as the 
results are divided according to the five image (in effect, quadru- 
plet), core triplet (in effect, doublet) and disc triplet morphologies. 

Compared to a typical elliptical, the flux ratios of maximal 
disc lens quads are significantly less susceptible to changes through 
substructure - almost three times the mass in satellite galaxies 
is needed to affect half the flux ratios. Spiral doublets, however, 
are slightly more susceptible. This is as expected: the greater 
the asymmetry of the matter distribution, the larger the 4-image 
cross-section, the lower the typical magnification of quads and the 
smaller the effect of satellite galaxies on the 4-image flux ratios. 
(So disc triplets, with typical magnifications in between quads and 
doublets, are also in between in susceptibility to flux changes.) In- 
deed, we do not expect many anomalous flux ratios in disc galaxy 
lenses to be caused by the satellite galaxies in their haloes unless 
these satellites total at least > 1O 1O M in mass, that is, w 10% 
of the luminous galaxy mass! (And that assumes the satellites are 
distributed with d 1( / 2 = 50kpc.) On moving to the sub-maximal 
case, the 4-image and disc triplet cross sections are substantially 
reduced, and so the numbers return towards their values in the el- 
liptical galaxy case. 

Note that we have compared the effects of a typical spiral with 
asymptotic circular speed of vo « 220 km s -1 to a typical ellip- 
tical with line-of-sight velocity dispersion of a = 250 km s . 



This makes sense, as early-type galaxies are more massive than 
late-type. If instead we were to carry out the comparison using 
a — vo/ v2, so that the kinematic properties of the models were 
comparable, then the elliptical would have a lower velocity dis- 
persion and so the discrepancy between the effects of spirals and 
ellipticals would be increased (see Figure[8}. 

4.3 Visible Substructure 

Some flux anomalies appear to be attributable to single dwarf 
galaxies. The most obv ious examples a r e the Sixth Ob- 
ject in MG0414+05 34 dSchechter & Moord Il993l) and G2 in 
CLASS B2045+265 dMcKean et alj|2007l) . In both these cases, a 
single piece of substructure gives a substantial improvement in the 
fit of a smooth model of the lens galaxy. An interesting question is 
- if flux ratio anomalies are due to substructure - how often might 
we expect to see a visible counterpart? At the typical redshifts of 
the lenses, only the very largest dwarf galaxies can be detected with 
ground-based telescopes, of course. 

This question can be answered by simulations in which the 
satellites are divided into two populations - Ni dark satellites of 
low mass Mi and N2 bright satellites of high mass M2 3> Mi - 
and flux-ratio changes are sought for the two populations separately 
and together. Some sample results of the simulations are shown in 
Table[T]for the usual e = 0.1 pseudo-isothermal elliptic potential 
with di/2 = 100 kpc. Note that, for the Plummer satellites, when 
the ratio of the total mass in population 1 to total mass in popula- 
tion 2 is 1:1, then the number of systems with anomalous flux ratios 
caused by population 1 as compared to population 2 is roughly in 
the ratio of 2:3, and a 3:2 mass ratio gives a roughly 1:1 effect. For 
the Hernquist satellites, a 1:1 mass ratio gives 1:1 effect. This con- 
firms the observations of Sj3]that a 10 9 A/q Plummer-model dwarf 
spheroidal is only ~ 50% more efficient at perturbing fluxes than 
a 10 7 Mq one, and more-massive Hernquist dwarfs are no more 
efficient than less massive ones. So from Table Q] we see that the 
most massive satellites do not contribute very disproportionately to 
anomalous flux ratios. 

The neat correspondence between ratios is not seen for 10:1 
mass ratios because F4 increases sub-linearly with mass: e.g. a sin- 
gle 10 9 A/q in Plummer satellite changes fluxes in 12% of quads, 
but four of them change fluxes in 28%, not 48%, of quads. How- 
ever, it is confirmed that ~ 85% of systems with affected fluxes 
still have affected fluxes if the 10% of mass in massive satellites is 
removed. 

The fraction of observed anomalous flux ratio systems with 
visible (and therefore high mass) culprits is actually quite high. 
Of the six four-image lens systems proposed bv lDalal & Kochanekl 
j2002l) , three (B2045+265, MG0414+0534, B 1608+656) have 
identified, visible substructure that may cause the flux perturbation. 
If this datum is taken at face value, it suggests that about half the 
mass in substructure is in dwarf galaxies large enough to be opti- 
cally identified. Simulations, however, tend to find that the satellite 
masses behave more like an M~ 2 distribution. This implies that 
there is equal mass in equal decades, and therefore that each decade 
is responsible for the causing roughly the same number of flux ra- 
tio anomalies (the 10 9 Mq decade causing only 50% more than the 
10 7 M Q decade). 

One resolution of this difficulty is to postulate that the visi- 
ble dwarf has been mistakenly designated as the culprit and that 
the anomalous flux ratio is really produced by another dark satel- 
lite. A hint that this may sometimes be the case is given by the 
unrealistically large flattening deduced for satellite G2 in CLASS 
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Table 1. The proportion F4 of quads with anomalous flux ratios for five runs of the two-population model. Here, the satellite populations have = 100 kpc 
and the elliptical lensing galaxy has the standard parameters. Population 1 satellites have mass 1.0 X 10 7 Mq, while population 2 satellites have mass 
1.0 X 10 9 Mq. (For Hernquist satellites, these are initial masses.) The Plummer model Population 2 contributes to F4 by ~ 50% more than population 1 
satellites, whilst the two Hernquist-model populations contribute equally. 
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ATiMi 


N 2 M 2 


F 4 


F 4 ,i 


F 4 , 2 


TViMi : 


N 2 M 2 


-^4,1 : F4,2 


Profile 


(M@) 


(Mq) 


(Both pops) 


(Pop 1 only) 


(Pop 2 only) 








Plummer 


1.0 x 10 10 


1.0 x 10 9 


43.2% 


37.3% 


12.2% 


10 


: 1 


3.1 : 1 


Plummer 


5.0 x 10 9 


5.0 x 10 9 


45.6% 


21.5% 


34.5% 


1 : 


1 


1 : 1.6 


Plummer 


6.0 x 10 9 


4.0 x 10 9 


44.3% 


23.4% 


28.1% 


3 : 


2 


1 : 1.2 


Hernquist 


2.0 x 10 10 


2.0 x 10 9 


28.4% 


25.1% 


5.1% 


10 


: 1 


4.9 : 1 


Hernquist 


1.2 x 10 10 


8.0 x 10 9 


22.8% 


16.3% 


10.6% 


3 : 


2 


3.1 : 2 


Hernquist 


1.0 x 10 10 


1.0 x 10 10 


20.9% 


13.2% 


12.1% 


1 : 


1 


1.1 : 1 



B2045+265 using model-fitting in lMcKean et alj J2007t) . In other 
words, it may simply be a chance effect that many anomalous flux 
ratio systems appear to have visible objects at or near the Einstein 
radius. The probability that a large dwarf lies close to the Einstein 
radius is easily computed from eq d 1 5 b - When diy 2 = 100 kpc, 
then there is a 10% probability of a large dwarf lying within two 
Einstein radii and 3% within one. This still does not seem large 
enough to explain the effect, but caution is needed as there may be 
other supplies of substructure along the line of sight for some of the 
lenses in groups and clu sters, like B1359+1 54. It is worth noting 
that simulations (see e.g. lZentner et alj|2005t) can sometimes yield 
highly anisotropic distribution of substructure in simulated halos: 
the projected subhalo mass within 10 kpc can vary by factor of 10 
depending on viewing angle. In this case, sightlines which project 
massive satellites onto small radius are also much more likely to 
project other satellites onto small radius. This may mean that our 
computed probabilities of 3-10 % may be on the low side. 

Another possible resolution of the difficulty is that luminous 
satellites, the baryons in which have cooled and condensed, may 
be much more centrally concentrated than dark satellites. In other 
words, it is possible that luminous satellites would have a much 
larger effect on flux ratios than their dark brethren because they 
are structurally different and more compact. Of course, the effect 
of baryons on dark haloes is subject to considerable uncertainties. 
This effect must be small for dwarfs l ike Draco, with a mass-to- 
light ratio of > 350 dKlevna et al. 2001), although it may be more 
significant for satellites li ke the Larg e Magellanic Cloud with a 
mass-to-light ratio of ~ 5 dAlvesll2004l) . It is also known that semi- 
analytic calculations of galaxy formation lead to too many compact, 
luminous satellites, as compared to what is seen around the Milky 
Way (see e.g., Koposov et al. 2007). The bright satellites predicted 
by semi-analytic theories are much too concentrated, suggesting 
that this effect is overplayed in the modelling. 

Nonetheless, the effect certainly exists at some level and is 
worth investigating. Baryon condensatio n may increase the cen tral 
density by a factor of between 4 and 160 JGnedin & Zhaol2002i) . al- 
though the larger numbers are probably more appropriate for galax- 
ies like the Milky Way rather than satellite galaxies. We change the 
lengthscale r s of the Hernquist model to mimic the result of baryon 
condensation and quantify the extra effect, compared to dark Hern- 
quist satellites, that highly-concentrated luminous satellites could 
have on flux ratios. (The Plummer satellites are already so com- 
pact that changing the Plummer scale radius makes no appreciable 
difference even when the central density is raised by a factor 100.) 
The Hernquist density law J X 3 b means that the central density goes 
as 7-7 2 , so we modify the scalelength- to-mass relation d!4t to 



Table 2. The (average) tidal mass Af sa t in Hernquist satellites of untrun- 
cated mass 10 9 Mq needed to affect fluxes in 50% of four-image systems, 
for various q. The elliptical galaxy has the standard parameters and the 
scalelength of the spatial distribution of satellites is d.1/2 = 100 kpc. 



Satellite 


N 


M sat 


Profile 




(Mq) 


Hernquist (dark) 


100 


5.6 x 10 10 


Hernquist (10 X concentration) 


20 


1.5 x 10 10 


Hernquist (100 X concentration) 


12 


1.1 x 10 10 



r B = q lO~V^t/M , (18) 

where q = 1, l/\/l0, or 0.1, corresponding to central densities 
of 1, 10 and 100 times that of the dark (uncontracted) Hernquist 
profile. The number of satellites of initial mass 10 9 Mq needed to 
affect flux ratios in 50% of four-image systems (all other parame- 
ters being the same as in TableQJ is shown in Table|2]. Almost four 
times the tidal mass in massive dark satellites is required, compared 
to massive luminous satellites with 10 times the central density, to 
affect the same proportion of flux ratios. Raising the central density 
a further factor of 10 has a smaller effect. The increasing effect on 
fluxes seen with decreasing r s is amplified by the extra resistance 
to tidal disruption of the more-concentrated satellites. 

Table[3]shows the results of two-population models, in which 
population 1 are dark satellites of initial mass 10 7 Mq and pop- 
ulation 2 are bright satellites of initial mass 10 9 M Q , with 10 or 
100 times the central density of their dark brethren. The fraction of 
anomalous flux ratio systems caused by bright substructure is now 
impressively high - for example, the second line of the table tells 
us that 42 % of systems have anomalous flux ratios, of which 28 % 
remain anomalous when the dark population is removed. In other 
words, over half of the anomalous flux ratio systems are caused, at 
least in part, by bright satellites. This is close to the statistics on ob- 
served systems - although caution is needed as large compression 
factors like 10 or 100 may well cause the importance of this effect 
to be overestimated for satellite galaxies. 

5 CONCLUSIONS 

It remains unclear whether dark matter satellites and substruc- 
ture are responsible for a nomalous flux ratios in strong lensing. 
iDalal & Kochanekl (2002) originally studied a sample of 7 radio- 
loud four-image lens systems and claimed evidence of anomalies 
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Table 3. The proportion F 4 of quads with anomalous flux ratios for three runs of the two population model, with the same parameters as in Table[T]but where 
population 2 satellites have increased central density. As the compression increases (q decreases), the tidally bound mass of population 2 satellites increases, 
but their effect on fluxes increases disproportionately, outstripping the effect of the diffuse low-mass population 1 satellites. The difference caused by a factor 
10 increase in central density is much larger than the difference caused by an extra step to a 100-fold increase. 
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(Me) 


(M ) 


(M @ ) 


(Both pops) 
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7.4 x 10 9 
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23% 
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8 
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42% 


16% 


28% 


Hernquist (Pop 2 with 100 X concentration) 


1200 


8 


7.4 x 10 9 


7.2 x 10 9 


48% 


16% 


36% 



in 6 of them. This seemingly suggests that anomalous flux ratios 
are very common. Here, we have carried out a theoretical study of 
the frequency of flux ratio anomalies as a function of lensing galaxy 
and dark matter substructure parameters. 

The likelihood that satellites affect flux ratios in strong lenses 
depends on their mass profile. Here, we considered compact Plum- 
mer spheres and diffuse, tidally stripped Hernquist profiles for our 
satellites, with the satellite size scaling as a power of mass. As the 
Hernquist satellites are more extended than the Plummer models, 
they therefore have a smaller effect on image fluxes - typically 
about 3 times the mass is needed to generate the same numbers 
of anomalous flux ratios. 

The probability that strong lensing flux ratios are affected by 
satellites is crucially dependent on their spatial distribution. The 
characteristic lengthscale dx/2 has a large effect on the fraction of 
lenses with affected flux ratios. Our spatial distributions of satellite 
galaxies are inspired by the observational data on the Milky Way, 
for which the satellite number density falls off as f ~ 3,5 in three- 
dimensions with a lengthscale of di/2 ~ 100 kpc. For such distri- 
butions, most satellites are too far out to affect the fluxes of images. 
For example, even with Plummer satellites, a mass of~ 3xlO 9 M 
is needed for 20 % of quadruplets to show anomalous flux ratios for 
a typical elliptical galaxy, rising to ~ 1.25 x lO lo A/ for 50 %. 
Lenses that are edge-on spiral galaxies with maximum discs (like 
the Milky Way) are more resistant to flux changes by satellites, so 
the mass in satellites and substructure has to be roughly a factor of 
3 times as great for the same proportion of quads to be affected. 
To obtain anything like the apparent abundance of anomalous flux 
ratios, then the scalelength of the substructure has to be different to 
what is known for the Milky Way satellites. 

Whether the flux ratios in a lens system are affected by satel- 
lites is sensitive to the total mass in satellites, but more weakly de- 
pendent of how this mass is apportioned between them. For Plum- 
mer model satellites, the probability that a given satellite changes a 
flux ratio increases with its mass only slightly faster than linearly, 
at least when its mass is between ~5x 1O 6 M and ~ 1O 9 M . 
For example, satellites of mass ~ 1O 9 M are only responsible 
for the causing ~ 50% more flux ratio anomalies than those of 
mass ~ 1O 7 M . For Hernquist model satellites, more massive 
ones seem no more efficient (per unit mass) at changing fluxes; 
indeed, more massive dwarfs, being more prone to tidal disruption, 
might even be less efficient than lighter, more compact ones. One 
interesting consequence is that, if matter in dark satellites is not 
predominantly in the most massive ones, then the contribution of 
the most massive satellite galaxies to flux ratio anomalies should 
not be predominant. 

In the light of this, the fact that so many anomalous flux ra- 
tios systems have optically identified substructure seems at out- 
set surprising. There seem to be two possible explanations. First, 



the visible substructure may have been wrongly identified as the 
cause, whereas dark substructure may be the true culprit. A large 
dwarf galaxy may by chance be projected close to the Einstein ra- 
dius, whereas unrelated dark substructure may be the major cause 
of the anomaly. Second, visible satellites may be more concen- 
trated than their dark cousins, a physical effect that may naturally 
arise from baryon condensation. Compression factors causing an 
enhancement of the central density by a factor of 10 in bright satel- 
lites seem to be ample to give a satisfactory explanation of the ob- 
served statistics. Nonetheless, such high compression factors are 
probably implausible except for the lar gest satellite galaxies . This 
seems to be in accord with the results of lMaccio et aL U2006l) , who 
found that including baryons in numerical simulations did not help 
in reconciling simulation results with the statistics of anomalous 
flux ratios. 

Finally, we remark that the likelihood that flux ratios are af- 
fected depends on the ellipticity of the main lens galaxy, but this 
dependence is much stronger in quads than doublets. This is a con- 
sequence of high magnification images being more easily affected 
by a dwarf galaxy than low magnification ones. Quads are more 
highly magnified than doublets (the disc triplets of spirals are in 
between), and changing the ellipticity of the main galaxy changes 
the typical magnification of quads more than it changes that of dou- 
blets. Generally, the greater the ellipticity the less the effect of satel- 
lite galaxies. For example, a given mass M sat of dwarf satellites 
around a typical edge-on maximum-disc spiral galaxy is signifi- 
cantly less likely to change image flux ratios than M sat around a 
typical elliptical galaxy, even though the spiral is less massive than 
the elliptical. 
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